-
-
Notifications
You must be signed in to change notification settings - Fork 7.6k
[V0][V1][Core] Add outlines integration for V1, and update V0 integration. #15975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
[V0][V1][Core] Add outlines integration for V1, and update V0 integration. #15975
Conversation
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
NOTE: Can't be merged until next version of outlines_core is released. |
Thank you for the PR! I will review it this week. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I reviewed the v0 code path. One ask is to add tests for this for disabling cache path.
And we should update the requirements/common.txt to the lowest version of outlines-core supported.
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
This pull request has merge conflicts that must be resolved before it can be |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
First round of review. A few things needs to be addressed here. but great progress so far.
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
This pull request has merge conflicts that must be resolved before it can be |
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
db324d3
to
8022c7a
Compare
Merge conflicts with your last PR are resolved. The test failure also seems to be unrelated to any changes in this PR. |
hmm maybe tried to merge from main ? |
@unaidedelf8777 I think only the logit processor tests in v0 is related to this PR here (at least for the outlines case) PTAL |
Head branch was pushed to by a user without write access
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
64c0086
to
3d4c13d
Compare
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
e70ee76
to
05023f6
Compare
I would try merging from main. One of the test failures I see should be fixed. |
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
1fd1223
to
d30598d
Compare
Just pushed a import path fix for the tool-use tests, so that should be happy now. The other failing tests seem unrelated to any changes here AFAICT. |
@aarnphm Just because I didn't mention it in the last message, all of the V0 logits processors tests are green now. |
Adds outlines as a guided decoding backend for V1, and updates the integration for V0.
The aim of this is three fold:
outlines
, and only useoutlines_core
write_mask_into
method onGuide
to write a bitmask in-place for use in logits masking.Because the dependency on
outlines
will be removed, support for grammar based decoding with the outlines backend will also be removed (CFG classes reside in theoutlines
package)cc @aarnphm